NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

The Value of Out-of-distribution Data

De_Silva, Ashwin; Ramesh, Rahul; Priebe, Carey E; Chaudhari, Pratik; Vogelstein, Joshua T (December 2025, NeurIPS)

Free, publicly-accessible full text available December 1, 2026
Prospective Learning in Retrospect

Bai, Yuxin; Shuai, Cecelia; De_Silva, Ashwin; Yu, Siyu; Chaudhari, Pratik; Vogelstein, Joshua T (July 2025, arxiv.org)

Free, publicly-accessible full text available July 10, 2026
Prospective Learning: Learning for a Dynamic Future

De_Silva, Ashwin; Ramesh, Rahul; Yang, Rubing; Yu, Siyu; Vogelstein, Joshua T; Chaudhari, Pratik (January 2025, arxiv.org)

Free, publicly-accessible full text available January 30, 2026
The training process of many deep networks explores the same low-dimensional manifold

https://doi.org/10.1073/pnas.2310002121

Mao, Jialin; Griniasty, Itay; Teoh, Han Kheng; Ramesh, Rahul; Yang, Rubing; Transtrum, Mark K; Sethna, James P; Chaudhari, Pratik (March 2024, Proceedings of the National Academy of Sciences)

We develop information-geometric techniques to analyze the trajectories of the predictions of deep networks during training. By examining the underlying high-dimensional probabilistic models, we reveal that the training process explores an effectively low-dimensional manifold. Networks with a wide range of architectures, sizes, trained using different optimization methods, regularization techniques, data augmentation techniques, and weight initializations lie on the same manifold in the prediction space. We study the details of this manifold to find that networks with different architectures follow distinguishable trajectories, but other factors have a minimal influence; larger networks train along a similar manifold as that of smaller networks, just faster; and networks initialized at very different parts of the prediction space converge to the solution along a similar manifold.
more » « less
Full Text Available
Analog content-addressable memory from complementary FeFETs

https://doi.org/10.1016/j.device.2023.100218

Liu, Xiwen; Katti, Keshava; He, Yunfei; Jacob, Paul; Richter, Claudia; Schroeder, Uwe; Kurinec, Santosh; Chaudhari, Pratik; Jariwala, Deep (February 2024, Device)

Full Text Available
The Value of Out-of-Distribution Data

'Silva, Ashwin De; Ramesh, Rahul; Priebe, Carey E; Chaudhari, Pratik; Vogelstein, Joshua T' (July 2023, arxiv.org)

Full Text Available
A Picture of the Space of Typical Learnable Tasks

Ramesh, Rahul; Mao, Jialin; Griniasty, Itay; Yang, Rubing; Teoh, Han Kheng; Transtrum, Mark K; Sethna, James P; Chaudhari, Pratik (May 2023, Proceedings of the 40 th International Conference on Machine Learning)

Full Text Available
A Picture of the Space of Typical Learnable Tasks

Ramesh, Rahul; Mao, Jialin; Griniasty, Itay; Yan, Rubing; Teoh, Han Kheng; Transtrum, Mark K; Sethna, James P; Chaudhari, Pratik (May 2023, Proceedings of the 40th International Conference on Machine Learning)

We develop information geometric techniques to understand the representations learned by deep networks when they are trained on different tasks using supervised, meta-, semi-supervised and con- trastive learning. We shed light on the following phenomena that relate to the structure of the space of tasks: (1) the manifold of probabilistic models trained on different tasks using different represen- tation learning methods is effectively low-dimen- sional; (2) supervised learning on one task results in a surprising amount of progress even on seem- ingly dissimilar tasks; progress on other tasks is larger if the training task has diverse classes; (3) the structure of the space of tasks indicated by our analysis is consistent with parts of the Word- net phylogenetic tree; (4) episodic meta-learning algorithms and supervised learning traverse differ- ent trajectories during training but they fit similar models eventually; (5) contrastive and semi-su- pervised learning methods traverse trajectories similar to those of supervised learning. We use classification tasks constructed from the CIFAR- 10 and Imagenet datasets to study these phenom- ena. Code is available at https://github.com/grasp- lyrl/picture of space of tasks.
more » « less
Full Text Available
Deformable Linear Object Prediction Using Locally Linear Latent Dynamics

https://doi.org/10.1109/ICRA48506.2021.9560955

Zhang, Wenbo; Schmeckpeper, Karl; Chaudhari, Pratik; Daniilidis, Kostas (May 2021, 2021 IEEE International Conference on Robotics and Automation (ICRA))

Full Text Available
Prospective Learning: Principled Extrapolation to the Future

De_Silva, Ashwin; Ramesh, Rahul; Ungar, Lyle; Shuler, Marshall Hussain; Cowan, Noah J; Platt, Michael; Li, Chen; Isik, Leyla; Roh, Seung-Eon; Charles, Adam; et al (August 2023, Conference on Lifelong Learning Agents (CoLLAs))

Full Text Available

Search for: All records